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ABSTRACT 

In the multi-fractal analysis of large scale matter distribution, the scale of transition to ho- 
mogeneity is defined as the scale above which the fractal dimension (D q ) of underlying point 
distribution is equal to the ambient dimension (D) of the space in which points are distributed. 
With finite sized weakly clustered distribution of t racers obtained fro m galaxy redshift sur- 
veys it is difficult to achieve this equality. Recently Bagla et al. (2008) have defined the scale 
of homogeneity to be the scale above which the deviation (AD q ) of fractal dimension from 
the ambient dimension becomes smaller than the statistical dispersion of AD q , i.e., <JAD q - In 
this paper we use the relation between the fractal dimensions and the correlation function to 
compute <TAD q for any given model in the limit of weak clustering amplitude. We compare 
AD q and ctad, for the ACDM model and discuss the implication of this comparison for the 
expected scale of homogeneity in the concordant model of cosmology. We estimate the upper 
limit to the scale of homogeneity to be close to 260 h _1 Mpc for the ACDM model. Actual 
estimates of the scale of homogeneity should be smaller than this as we have considered only 
statistical contribution to cad, and we have ignored cosmic variance and contributions due 
to survey geometry and the selection function. Errors arising due to these factors enhance 
o~AD q and as AD q decreases with increasing scale, we expect to measure a smaller scale of 
homogeneity. We find that as long as non linear correction to the computation of AD q are 
insignificant, scale of homogeneity does not change with epoch. The scale of homogeneity 
depends very weakly on the choice of tracer of the density field. Thus the suggested definition 
of the scale of homogeneity is fairly robust. 

Key words: cosmology : theory, large scale structure of the universe — methods: statistical 



1 INTRODUCTION 



One of the primary aims of galaxy redshift surveys is to determine 
the d i stribution of lum i nous matter in the Universe Jkuchra et"al] 
19831: iLillv et aljll995l: IShectman et al.lll996l: iHuchra et alj | l999 |: 



York et alj 120001: iColless et all 120011: iGiavalisco et all 120041 : 
Le Fevre et alj|2005l " IScoville et alj|2007l) . These surveys have re 
vealed a large variety of structures starting from groups and clus- 
ters of galaxies, extending to super-clusters and an interconnected 
networ k of filaments which appears to ex t end across very large 
scales {Bond. Kofman. & Pogosvanl 1 19961 : [Springel et alj 120051 : 
Faucher-Giguere, Lid z. & Hernauistfl2008h . We expect the galaxy 
distribution to be homogeneous on large scales. In fact, the assump- 
tion of large scale homogeneity and isotropy of th e universe is the 
basis of most cosmological models ( Einsteinl l 1917t> . In addition to 
determining the large scale structures, the redshift surveys of galax- 
ies can also be used to verify whether the gal axy distribution does 



indeed become homogeneous at some scale( Einasto & Gramann 
19931: iMartinez et al.|[l998: Yad av et al.|[2005l : ISvlos Labini et al 



2009al 3). Fractal dimensions of the galaxy matter distribution can 
be used to test the conjecture of homogeneity. One advantage of 



using fractal dimensions over other analyses is that in the former 
we don't require the assumption of an average density in the point 
set Clones et al.||2004l) . 

When doing our analysis we often work with volume limited 
sub-samples extracted from the the full magnitude samples of the 
galaxies. This is done in order to avoid an explicit use of the selec- 
tion function. The volume limited sub-samples constructed in this 
manner from a flux limited parent sample naturally have a much 
smaller number of galaxies. This was found to be too restrictive for 
the earliest surveys and corrections for varying selection function 
we re used explicitly in order to determine the scale of h omogene- 
ity teharadwai et al.lll999l : rAmendola & Palladindl 19991) . But with 
modern galaxy redshift surveys, this limitation is less severe. 

Making a volume limited sub-sample requires assumption of 
a cosmological model and this may be thought of as an undesirable 
feature of data analysis. However, if we directly use raw data and 
do not account for a redshift dependent selection function in any 
way, it is obvious that the selection function will dominate in any 
large scale description of the distribution of galaxies. This is only 
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to be expected as we see only brighter galaxies at larger distances 
in a flux or magnitude limited sample. 

For a given survey one computes the fractal dimension of the 
point set under consideration and the scale beyond which the fractal 
dimension is equal to the ambient dimension of the space in which 
the particles are distributed is referred to as the scale of homo- 
geneity of that distribution. Mathematically the fractal dimension 
is defined for an infinite set of points. Given that the observational 
samples are finite there is a need to understand the deviations in 
the fractal dimensions arising due to the finite number of points. In 
practice we should identify a scale to be the scale of homogeneity 
where the rms error in the fractal dimension is comparable to or 
greater than th e deviation of the fractal dimension from the ambi- 
ent dimension ( iBagla et alj|2003) . It is not possible to distinguish 
between a given point set and a homogeneous point set with the 
same number of points in the same volume beyond the scale of 
homogeneity . 

In all practical cases of interest the fractal dimens ion is never 
equal to the ambient integer dimension of the space. IBagla et al.l 
(2008) have shown that these deviations in fractal dimension oc- 
cur mainly due to weak clustering present in the galaxy distribu- 
tion, with a smaller contribution arising due to the finite number of 
galaxies in the distribution. This work assumed the standard cos- 
mological model in order to derive a relation between the fractal 
dimension on one hand, and, a combination of number density of 
tracers of the density field and clustering on the other. It was as- 
sumed that the clustering is weak at scales of interest. In this paper, 
we revisit the expected deviation of fractal dimension for a finite 
distribu tion of weakly clus tered points and verify the relations de- 
rived in Ba gla et aT] ([2008). For this purpose we have used a parti- 
cle distribution from a large volume TV— Body simulations. Further, 
we generalise the relation between clustering and fractal dimension 
to compute the rms error in determination of fractal dimensions 
and clustering from data. We then discuss the implications of this 
for the expected scale of homogeneity in the standard cosmologi- 
cal model. We also comment on some recent determinations of the 
scale of homogeneity from observations of galaxy distribution in 
redshift surveys. 

The plan of the paper is as follows. In section §2 we present 
a brief introduction to fractals and fractal dimensions. Subsection 
§2.1 describes the fractal dimension for a weakly clustered distri- 
bution of finite number of points. Section §3 discusses the calcu- 
lation of variance in £ and hence the variance in offset to fractal 
dimension. We present the results in section §4, and conclude with 
a summary of the paper in §5. 



2 FRACTALS AND FRACTAL DIMENSIONS 

The name fracta l was introduced by Benoit B. Mandelbrot 
(Mandelbrot 1982) to characterise geometrical figures which may 
not be smooth or regular. One of the definitions of a fractal is that it 
is a shape made of parts similar to the whole in some way. It is use- 
ful to regard a fractal as a set of points that has properties such as 
those listed below, rather than to look for a more precise definition 
which will almost certainly exclude some intere sting cases. A set 
F is a fractal if it satisfies most of the following ( lFalconeij|2003l) : 

(i) F has a fine structure, i.e., detail on arbitrarily small scales. 

(ii) F is too irregular to be described in traditional geometrical 
language, both locally and globally. 

(iii) F has some form of self-similarity, perhaps approximate or 
statistical. 



(iv) In most cases of interest F is defined in a very simple way, 
perhaps recursively. 

Fractals are characterised using the so called fractal dimensions. 
These can be defined in different ways, which do not necessarily 
coincide with one another. Therefore, an important aspect of study- 
ing a fractal structure is the choice of a definition for fractal dimen- 
sion that best applies to the case in study. 

Fractal dimension provides a description of how much space a 
point set fills. It is also a measure of the prominence of the irregu- 
larities of a point set when viewed at a given scale. We may regard 
the dimension as an index of complexity. We can, in principle, use 
the concept of fractal dimensions to characterise any point set. 

The simplest definition of the fractal dimension is the so called 
Box counting dimension. Here we place a number of mutually ex- 
clusive boxes that cover the region in space containing the point set 
and count the number of boxes that contain some of the points of 
the fractal. The fraction of non-empty boxes clearly depends on the 
size of boxes. Box counting dimension of a fractal distribution is 
defined in terms of non empty boxes TV(r) of radius r required to 
cover the distribution. If 

TV(r) oc r D » (1) 

we define Db to be the box counting dimension 

(Barabasi & Stanley! 1 19951) . In general Df, is a function of 
scale. One of the difficulties with such a definition is that it does 
not depend on the number of particles inside the boxes and rather 
depends only on the number of boxes. It provides very limited 
information about the degree of dumpiness of the distribution and 
is more of a geometrical measure. To get more detailed information 
on clustering of the distribution we use the concept of correlation 
dimension. We have chosen the correlation d i mension, among 
variou s other definitions (see e.g. Borgani 1995; Martinez & Saar 
2002), due to its mathematical simplicity and the ease with which 
it can be adapted to calculations for a given point set. The formal 
definition of correlation dimension demands that the number of 
particles in the distribution should be infinite (e.g. TV — > oo in 
equation O. We use a working definition for a finite point set. 
Calculation of the correlation dimension requires the introduction 
of correlation integral given by 

M 
£=1 

Here we assume that we have TV" points in the distribution. M is the 
number of centres on which spheres of radius r have been placed 
with the requirement that the entire sphere lies inside the distribu- 
tion of points. Spheres are centred on points within the point set and 
it is clear that M < TV. Here m (r) denotes the number of particles 
within a distance r from i th point: 

N 

TK(r)=5^e(r- | a* -ay [) (3) 

where Q(x) is the Heaviside function. If we consider the distribu- 
tion function for the number of points inside such spherical cells, 
we can rewrite Ci in terms of the probability of finding n particles 
in a sphere of radius r. 

N 

C 2 (r) = i^nP(n;r,TV) (4) 

71=0 
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where P(n;r,N) is the normalised probability of getting n out 
of N points as neighbours inside a radius r of any of the points. 
The correlation dimension D2 of the distribution of points can 
be defined via the power law scaling of correlation integral, i.e., 

C 2 (r)xr D2 . 



D 2 (r) = 



d log C 2 (r) 



(5) 



d log r 

Since the scaling behaviour of C2 can be different at different 
scales, we expect the correlation dimension to be a function of 
scale. For the special case of a homogeneous distribution we see 
that the correlation dimension equals the ambient dimension for an 
infinite set of points. 

C2 (r) defined in the manner given by equation |4]provides the 
average of the distribution function of P(n\ r, TV). In order to char- 
acterise the distribution of points, we need information about all 
the moments of the distribution function. This leads us to the gen- 
eralised dimension D q , also known as Minkowski-Bouligand di- 
mension. This is a generalisation of the correlation dimension D2 . 
The correlation integral can be generalised to define C q {r) as 



C q {r) = T7 



1 \ ^ 



,9-1 



N 

n-0 



P(n) = -{W 



q—l\ 



which is used to define the Minkowski-Bouligand dimension 



D q (r) = 



rflogC g (r) 
dloer 



(6) 



(7) 



We expect the generalised dimension to be scale dependent in gen- 
eral. If the value of D q {r) is independent of both q as well as r then 
the point distribution is called a mono-fractal. 

If we are dealing with a finite number of points in a finite 
volume then we can always define an average density. This al- 
lows us to relate the generalised correlation integral with correla- 
tion functions. For q > 1, the generalised correlation integral and 
the Minkowski-Bouligand dimension can be related to a combina- 
tion of n-point correlation functions with the highest n equal to q 
and the smallest n equal to 2. The contribution to C q is dominated 
by regions of higher number density of points for g > 1 whereas 
for q <C the contribution is dominated by regions of very low 
number density. This clearly implies that the full spectrum of gen- 
eralised dimension gives us information about the entire distribu- 
tion: regions containing clusters of points as well as voids that have 
few points. 

This allows us to connect the concept of fractal dimensions 
with the statistical measures used to quantify the distribution of 
matter at large scales in the universe. In the situation where the 
galaxy distribution is homogeneous and isotropic on large scales, 
we intuitively expect D q ~ D — 3 independent of the value of 
q. Whereas at smaller scales we expect to see a spectrum of values 
for the fractal dimension, all different from 3. It is of considerable 
interest to find out the scale where we can consider the universe to 
be homogeneous. We address this issue using theoretical models in 
this paper. 

In an earlier paper we have derived a leading order expres- 
sion for generalised correlation integral for homogeneous as well 
as weakly clustered distributions of points ( Bagla et al. 200£j). We 
found that even for a homogeneous distribution of finite number 
of points the Minkowski-Bouligand dimension, D q (r), is not ex- 
actly equal to the ambient dimension, D. The difference in these 
two quantities arises due to discreteness effects. Hence for a finite 
sample of points, the correct benchmark is not D but the estimated 
value of D q for a homogeneous distribution of same number of 



points in the same volume. An interesting aside is that the correc- 
tion due to a finite size sample always leads to a smaller value for 
D q (r) than D. As expected, this correction is small if the average 
num ber of points in sph eres is large, i.e., N S> 1. 

lBaglaetal] [2008) have demonstrated that the general expres- 
sion for m th order moment of a weakly clustered distribution of 
points is given by: 



(AT), 



TV" 



(m) (m — 1) m(m + 1) - 
1 + . .-. H -£ 



-O (f 



o 



2N 

I 
N 



where 

N = nV & f(r) 



-3 / £(x)x 2 dx 



(8) 



(9) 



is the average number of particles in a randomly placed sphere and 
the volume averaged two point correlation function respectively. 

The generalised correlation integral (eq (6]l for this distribution 
can now be written as 



NC q (r) 



l + (g - 1) (q - 2) + q(q-l) f 



27V 2 



\N 2 J 



(10) 



The third term on the right hand side in fact encapsulates the con- 
tribution of clustering. This differs from the last term in the corre- 
sponding expression for a homogeneous distribution as in that case 
the "clustering" is only due to cells being centred at points whereas 
in this case the locations of every pair of points has a weak correla- 
tion. It is worth noting that the highest order term of order O (£ 2 ) 
has a factor O (q 3 ) and hence can become important for suffi- 
ciently large q. This may be quantified by stating that q£ <C 1 is the 
more relevant small parameter for this expansion. The Minkowski- 
Bouligand dimension for this system can be expressed as: 



D q (r) 



D [ I 
D I I 



D 1- 



(g-2) 
2N 

(9-2) 
2N 

(9-2) 



2 d log r 

qrd£ 
2 dr 



&D q (r) = 



2N "2^ W 
D - (AD q ) N - (AD q ) clus 
~(AD q ) N -(AD q ) clua 



(11) 



For a weakly clustered distribution we note that 



• For hierarchical clustering, both terms have the same sign and 
lead to a smaller value for D q as compared to D. 

• Unless the correlation function has a feature at some scale, 
smaller correlation corresponds to a smaller correction to the 
Minkowski-Bouligand dimension. 

• If the correlation function has a feature then it is possible to 
have a small correction term (AD q ) clus for a relatively large £. In 
such a situation, the relation between £ and (AD q ) clus is no longer 
one to one. In such a case (AD q ) clus does not vary monotonically 
with scale. 
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Figure 1. Compai'ison of AD4 calculated using equation|7](solid line), and 
estimated using equation II U dashed line) for the large volume N — Body 
simulation. 



The model described here has been validated w ith the 
multinomial-multi-fractal model (see e.g. IBagla et al.l l2008). In the 
following discussion, we validate our model with the help of a large 
volume N-Body simulation in order to check it in the setting where 
we wish to use it. 



2.1 iV-Body Simulations 

We use a the TreePM code for cosmological sim ulations (Bagla 
120021 : iBagla & Ravll2003l ; ItQiandai & Baglall2009l) to simulate the 
distribution of particles. The simulations were ru n with the set 
of co smological parameters favoured by WMAP5 ( Komat su et al.l 
120091) as the best fit for the ACDM class of models: Q„ r = 0.2565, 
fi A = 0.7435, n s = 0.963, cr 8 = 0.796, h = 0.719, and, 
Q. b h 2 = 0.02273. The simulations were done with 512 3 particles 
in a comoving cube of side 1024ft" 1 Mpc. 

We computed the two point correlation function £(r) directly 
from the N— Body simulation ou tput by using a subsample of parti- 
cles jPeeblesll980llKaiserl 1984). The volume averaged correlation 
function £(r) follows from £(r) using equation|9] 

Figure [T]presents the comparison between AD q estimated di- 
rectly using equation [7] from the N— Body simulation, and, the 
values computed using equation QT] For the last curve we use the 
values of £(r) and £(r) computed from the simulation. We find that 
the two curves track each other and the differences are less than 
10% at all scales larger than 60 h _1 Mpc. This is fairly impressive 
given that we have only taken the leading order contributions into 
account. This validates the relation between the correlation func- 
tions and the fractal dimensions in the limit of weak clustering. 
We find that as we go to larger scales, AD q becomes smaller but 
does not vanish. One may then ask, is there no scale where the uni- 
verse becomes homogeneous? The answer to this question lies in a 
comparison of the offset AD q with the dispersion expected due to 
statistical errors, we discuss this in detail in the following section. 



3 VARIANCE IN FRACTAL DIMENSION 

In this section we use the relation between the two point correla- 
tion function, number density of points and the fractal dimension to 
estimate the statistical er ror in determination of the fractal dimen- 
sion. We have shown in IBagla et all |2008) that for most tracers 
of the large scale density field in the universe, the contribution of 
the finite number density of points is much smaller than the con- 
tribution of clustering in terms of the deviation of fractal dimen- 
sion from the ambient dimension. In the following discussion we 
assume that the contribution of a finite number density of points 
can be ignored. With this assumption, we elevate the relation be- 
tween the fractal dimension and the two point correlation function 
to the dispersion of the two quantities. The statistical error in the 
Minkowski-Bouligand Dimension can then be estimated from the 
statistical error in the correlation function. 



Var{AD q } ~ Var{(AD q ) clus } 
This implies that 

Var{AD q } ~ Var{^ (£» - £(r))} 



(12) 



(Var{£(r)} + Var{^(r)} 



V 2 J 

+2<7ot;{f(r)£(r)}) (13) 

We can make use of the fact that \Cov(x, y) \ < a x a y where x and 
y are random variables. We get: 

fair) - a e (r)\ < < |a f (r) + <r ( (r)\ (14) 

If we find that one of the <7f (r) or a^(r) is much larger than the 
other then we get: 



(TAD q ^ ^ Max (o-f (r), cr ? (r)) 



(15) 



The problem is then reduced to the evaluation of statistical error in 
the correlation function. 

Starting point in the analytical estimate of the statistical error 
in £(r) is the assumption that the variance in the power spectrum 
is that expected for Gaussian fluctuations with a shot-noise com- 
ponent arisin g from the finite number of objec ts used to trace the 



ponent arising Irom tne nmte number ot objects 
density field jFeldman et al Jl 1 9941 ; [Smitrill2009h : 



ap(k) 



(16) 



where V is the simulation volume, and n is the mean d ensity of the 
object s considered (dark matter particles or halos). Ang ulo et al.l 
( 2008) found good agreement between this expression and the vari- 
ance in P(k) measured from numerical simulations. In order to 
develop a consistent approach, we use 1/n = in the following 
discussion. 

The covariance of the two-point correlation f unction is defined 
by teernsteirJll994l : ICohni r2006; Smi th et alj|2008l) : 



C e (r,r') 



dkk 2 
2tt 2 



0))(C(0-e>'))) 

jo(kr)jo(kr')a 2 P (k), 



(17) 



where the last term can be replaced by Eq. U6| >. The variance in 
the co rrelation function is simply <r|(r) = CV (r, r). iKazin et"al] 
(2009) have tested this formula against the variance derived from 
the mock catalogs for both dim and bright galaxy samples of SDSS, 
and found that at 50 < r < 100 h~ 1 Mpc the variance is consistent 
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with equation[T7] The direct application of Eq. d!7t leads to a sub- 
stantial over prediction of the variance, since it ignores the effect 
of binning in pair separat i on which reduces the covariance in the 
measurement dCohrfeOodlSmifh et al]|2008l) . 

An estimate of the correlation function in the i th pair separa- 
tion bin corresponds to the shell averaged correlation function 



(18) 



where Vi is the volume of the shell. The covar iance of this estimate 
is given by the following expression, e.g., see dSanchez et al.l2008h 



where 



dk k 2 -. 



2tt 



d 3 r I dVC ? (r,r') 
2 3o(k,i)jo(k,j)aj>(k), 



jo(k,i) = i J j (kr)d 3 r. 



(19) 



(20) 



is the volume averaged Bessel function. So the variance in £ is 
a|(r) = C|(i, i). We find that at scales of interest Cf(r) <C 
From here, it is straightforward to compute the standard deviation 
in AD q using equation 1731 



4 SCALE OF HOMOGENEITY 

We can describe a distribution of points to be homogeneous if the 
standard deviation of AD q is greater than AD q . Note that this for- 
mulation gives us a unique scale: above this scale it is not possible 
to distinguish between the given point distribution and a homoge- 
neous distribution. In the cosmological context, we have bypassed 
the details of contribution to the variance arising from the survey 
geometry, survey size, etc. Clearly, if we were to take these con- 
tributions into account, the error in determination of AD q will be 
larger and hence w e will recover a smaller scale of homogeneity 
dSarkar et alj2009h . This change results not from any change in the 
real scale of homogeneity but because of the limitations of observa- 
tions. In our view, the real scale of homogeneity is one where these 
limitations do not matter. In the limit where we ignore these addi- 
tional sources of errors, we have the following general conclusions: 

• As long as non-linear correction are not important, the scale 
of homogeneity does not change with epoch. 

• In real space, the scale of homogeneity is independent of the 
tracer used, because the deviation AD q as well as the dispersion in 
this quantity scale in the same manner with bias. This follows from 
the fact that bias in correlation function can be approximate d by a 
constant number a t a given epoch at sufficiently large scalesl Bagla 
1 1 998L IS eli akl2OO0t) . 

• Redshift space distortions introduce some bias dependence in 
the scale of homogeneity. 

• As long as our assumption of q£ <C 1 is valid, the scale of 
homogeneity is the same for all q. 

These points highlight the robust and unique nature of the scale of 
homogeneity defined in the manner proposed in this work. 

We now turn to more specific conclusions in the cosmological 
setting below. We have calculated ctad, and AD q using the power 
spectrum of a large N-body simulation as well as using the lin- 
ear power spectrum obtained from WMAP5 parameters. We have 
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Figure 2. Variation of AD q and its predicted standard deviation with scale 
is shown in these plots. The top panel shows these for the ACDM simula- 
tion described in the text, the middle panel shows the same using the linearly 
evolved power spectrum and the lower panel again shows the same quantity 
with data from simulations patched with the linearly evolved power spec- 
tram at large scales. AD q /(0.5qD) is shown using a solid curve in each 
panel, the dashed line shows the dispersion a&D q /(0.5gD) and the dot- 
dashed line shows 2a/^£> q /(0.5qD). The intersection of AD q /(0.5qD) 
and &/\£) q /(0.5qD) is the scale beyond which we cannot distinguish be- 
tween the ACDM model and a homogeneous distribution. 
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summarised these findings in Figure [2] The top panel in the fig- 
ure shows the deviation and the dispersion as estimated from an 
N-Body simulation. In this case the estimated scale of homogene- 
ity is just above 320 h _1 Mpc. The corresponding calculation with 
the linear theory gives a scale of just over 280 h~ 1 Mpc. Part of 
the reason for this difference is that the simulation does not con- 
tain perturbations at very large scales. If we patch the power spec- 
trum derived from simulations with the linearly extrapolated power 
spectrum of fluctuations at these scales then we get 260 h _1 Mpc as 
the scale of homogeneity, broadly consistent with the value derived 
from linear theory. Curves for this last estimate are shown in the 
lowest panel of Figure[2] If we compare AD q with 2oAD q instead, 
then we get scales of 230, 215 and 220 h _1 Mpc, respectively, in the 
three cases. As expected, increasing the error or dispersion leads to 
a smaller scale of homogeneity. Once again, the answers derived 
using different approaches are broadly consistent with each other 
and even the small deviations can be understood i n terms of generic 
aspects of non-linear evolution of perturbations ( Bharadwaj 1996; 
Bagla & Padmanabhanlll997l ; lEisenstein. Seo. & Whitell2007l) ! 



5 SUMMARY 

In this paper we have verified the relation between the two point 
correlation function and fractal dimensions for a simulated dis- 
tribution of matte r at very large scales. This relation indicates 
(Bagl a et al. 2008) that if we use a strongly biased tracer, such as 
Luminous Red Galaxies or clusters of galaxies then the deviation of 
the fract al dimension from 3 is larger. This is consistent with obser- 
vations l lEinasto et alj [l997). It is also worthwhile to mention that 
several observations ha ve reported detection of excess clustering a t 
a scale of 120 h _1 Mpc dBroadhurst et alJl990hlEinasto et al.ll997l) 
and distances between the largest overdensities have been observed 
to exceed the s cale of homogeneity derived from most observations 
l lEinastoll2009h . 

In this paper, we have also used the relation to estimate statis- 
tical uncertainty in determination of fractal dimensions. The scale 
of homogeneity is then taken to be the scale where this uncertainty 
is the same as the offset of fractal dimension from 3. We show ex- 
plicitly that the uncertainty scales with bias in the same manner as 
the offset of fractal dimension, thus the true scale of homogene- 
ity is not sensitive to which objects are used as long as the survey 
volume is large enough to contain a sufficient number. The spirit 
of this paper is to estimate the fractal dimensions for an ideal ob- 
servation, and not to worry about the limitations of current obser- 
vations. The connection between the fractal dimension and corre- 
lation function allows us to make this leap in the limit of weak 
clustering that is clearly applicable at large scales. Applying this to 
a cosmological situation with the model favoured by WMAP-5 ob- 
servations, we have estimated the scale of homogeneity to be close 
to 260h _1 Mpc. As we have ignored several sources of uncertainty 
that are likely to be present in most observational data sets, this es- 
timated scale of homogeneity is in some sense the upper limit of 
what can be determined as the scale of homogeneity from observa- 
tions. It is comforting to note that the scale of homogeneity is much 
smaller than the Hubble scale. 

An attractive feature of this way of defining the scale of homo- 
geneity is that it can be defined self consistently within the setting 
of the cosmological model with density perturbations. Further, this 
scale is independent of epoch and largely independent of the choice 
of tracer for the large scale density field in the universe. The scale 
of homogeneity is the same for the entire spectrum of fractal di- 



mensions, within the constraints of the underlying assumption that 
q |f | <C 1, making this a unique scale in the problem. 

A comparison with recent determination of the scale of ho- 
mogeneity from observations is pertinent. Observational analyses 
have shown that the scale of homogeneity may be as small as 
60 - 70 h _1 Mpc dYadav et alJiobl i lHogg et alj2005l : ISarkar et al.l 
2009). This is much smaller than the scale of homogeneity we have 
found using our method. However, we have ignored the effect of 
survey geometry and size, and hence in the analysis of any obser- 
vational dataset there are additional contributions to (JAD q - Any 
increase in the value of <JAD q leads to a smaller scale of homo- 
geneity. In this sense our estimate is the ideal scale of homogeneity 
and may be treated as an upper limit. Note that the additional con- 
tributions to g AD q are required to increase its value by more than 
an order of magnitude above our determination for the scale of ho- 
mogeneity to be as small as 70 h~ 1 Mpc. It should be possible to 
demonstrate consistency with our calculation through an explicit 
estimate of errors in observational survey. In the long term we ex- 
pect that an increase in survey depth and size will gradually lead to 
lowering of errors from other sources and we should obtain a larger 
scale of homogeneity. 
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